48 research outputs found

    Comparative analysis of the core proteomes among the Pseudomonas major evolutionary groups reveals species-specific adaptations for Pseudomonas aeruginosa and Pseudomonas chlororaphis

    Get PDF
    The Pseudomonas genus includes many species living in diverse environments and hosts. It is important to understand which are the major evolutionary groups and what are the genomic/proteomic components they have in common or are unique. Towards this goal, we analyzed 494 complete Pseudomonas proteomes and identified 297 core-orthologues. The subsequent phylogenomic analysis revealed two well-defined species (Pseudomonas aeruginosa and Pseudomonas chlororaphis) and four wider phylogenetic groups (Pseudomonas fluorescens, Pseudomonas stutzeri, Pseudomonas syringae, Pseudomonas putida) with a sufficient number of proteomes. As expected, the genus-level core proteome was highly enriched for proteins involved in metabolism, translation, and transcription. In addition, between 39–70% of the core proteins in each group had a significant presence in each of all the other groups. Group-specific core proteins were also identified, with P. aeruginosa having the highest number of these and P. fluorescens having none. We identified several P. aeruginosa-specific core proteins (such as CntL, CntM, PlcB, Acp1, MucE, SrfA, Tse1, Tsi2, Tse3, and EsrC) that are known to play an important role in its pathogenicity. Finally, a holin family bacteriocin and a mitomycin-like biosynthetic protein were found to be core-specific for P. cholororaphis and we hypothesize that these proteins may confer a competitive advantage against other root-colonizers.</jats:p

    The Pivotal Role of Protein Phosphorylation in the Control of Yeast Central Metabolism

    Get PDF
    Protein phosphorylation is the most frequent eukaryotic post-translational modification and can act as either a molecular switch or rheostat for protein functions. The deliberate manipulation of protein phosphorylation has great potential for regulating specific protein functions with surgical precision, rather than the gross effects gained by the over/underexpression or complete deletion of a protein-encoding gene. In order to assess the impact of phosphorylation on central metabolism, and thus its potential for biotechnological and medical exploitation, a compendium of highly confident protein phosphorylation sites (p-sites) for the model organism Saccharomyces cerevisiae\textit{Saccharomyces cerevisiae} has been analyzed together with two more datasets from the fungal pathogen Candida albicans\textit{Candida albicans}. Our analysis highlights the global properties of the regulation of yeast central metabolism by protein phosphorylation, where almost half of the enzymes involved are subject to this sort of post-translational modification. These phosphorylated enzymes, compared to the nonphosphorylated ones, are more abundant, regulate more reactions, have more protein–protein interactions, and a higher fraction of them are ubiquitinated. The p-sites of metabolic enzymes are also more conserved than the background p-sites, and hundreds of them have the potential for regulating metabolite production. All this integrated information has allowed us to prioritize thousands of p-sites in terms of their potential phenotypic impact. This multi-source compendium should enable the design of future high-throughput (HTP) mutation studies to identify key molecular switches/rheostats for the manipulation of not only the metabolism of yeast, but also that of many other biotechnologically and medically important fungi and eukaryotes.G.D.A. acknowledges financial support from the “ARISTEIA II” Action of the “Operational Programme Education and Lifelong Learning” that is cofunded by the European Social Fund and National Resources (code 4288 to G.D.A.). S.G.O. acknowledges the University of Cambridge for the award of sabbatical leave that allowed him to work with G.D.A. at the University of Thessaly, Greece

    Just how versatile are domains?

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Creating new protein domain arrangements is a frequent mechanism of evolutionary innovation. While some domains always form the same combinations, others form many different arrangements. This ability, which is often referred to as versatility or promiscuity of domains, its a random evolutionary model in which a domain's promiscuity is based on its relative frequency of domains.</p> <p>Results</p> <p>We show that there is a clear relationship across genomes between the promiscuity of a given domain and its frequency. However, the strength of this relationship differs for different domains. We thus redefine domain promiscuity by defining a new index, <it>DV I </it>("domain versatility index"), which eliminates the effect of domain frequency. We explore links between a domain's versatility, when unlinked from abundance, and its biological properties.</p> <p>Conclusion</p> <p>Our results indicate that domains occurring as single domain proteins and domains appearing frequently at protein termini have a higher <it>DV I</it>. This is consistent with previous observations that the evolution of domain re-arrangements is primarily driven by fusion of pre-existing arrangements and single domains as well as loss of domains at protein termini. Furthermore, we studied the link between domain age, defined as the first appearance of a domain in the species tree, and the <it>DV I</it>. Contrary to previous studies based on domain promiscuity, it seems as if the <it>DV I </it>is age independent. Finally, we find that contrary to previously reported findings, versatility is lower in Eukaryotes. In summary, our measure of domain versatility indicates that a random attachment process is sufficient to explain the observed distribution of domain arrangements and that several views on domain promiscuity need to be revised.</p

    Improved homology-driven computational validation of protein-protein interactions motivated by the evolutionary gene duplication and divergence hypothesis

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Protein-protein interaction (PPI) data sets generated by high-throughput experiments are contaminated by large numbers of erroneous PPIs. Therefore, computational methods for PPI validation are necessary to improve the quality of such data sets. Against the background of the theory that most extant PPIs arose as a consequence of gene duplication, the sensitive search for homologous PPIs, i.e. for PPIs descending from a common ancestral PPI, should be a successful strategy for PPI validation.</p> <p>Results</p> <p>To validate an experimentally observed PPI, we combine FASTA and PSI-BLAST to perform a sensitive sequence-based search for pairs of interacting homologous proteins within a large, integrated PPI database. A novel scoring scheme that incorporates both quality and quantity of all observed matches allows us (1) to consider also tentative paralogs and orthologs in this analysis and (2) to combine search results from more than one homology detection method. ROC curves illustrate the high efficacy of this approach and its improvement over other homology-based validation methods.</p> <p>Conclusion</p> <p>New PPIs are primarily derived from preexisting PPIs and not invented <it>de novo</it>. Thus, the hallmark of true PPIs is the existence of homologous PPIs. The sensitive search for homologous PPIs within a large body of known PPIs is an efficient strategy to separate biologically relevant PPIs from the many spurious PPIs reported by high-throughput experiments.</p

    The Leucine Zipper Domains of the Transcription Factors GCN4 and c-Jun Have Ribonuclease Activity

    Get PDF
    Basic-region leucine zipper (bZIP) proteins are one of the largest transcription factor families that regulate a wide range of cellular functions. Owing to the stability of their coiled coil structure leucine zipper (LZ) domains of bZIP factors are widely employed as dimerization motifs in protein engineering studies. In the course of one such study, the X-ray structure of the retro-version of the LZ moiety of yeast transcriptional activator GCN4 suggested that this retro-LZ may have ribonuclease activity. Here we show that not only the retro-LZ but also the authentic LZ of GCN4 has weak but distinct ribonuclease activity. The observed cleavage of RNA is unspecific, it is not suppressed by the ribonuclease A inhibitor RNasin and involves the breakage of 3′,5′-phosphodiester bonds with formation of 2′,3′-cyclic phosphates as the final products as demonstrated by HPLC/electrospray ionization mass spectrometry. Several mutants of the GCN4 leucine zipper are catalytically inactive, providing important negative controls and unequivocally associating the enzymatic activity with the peptide under study. The leucine zipper moiety of the human factor c-Jun as well as the entire c-Jun protein are also shown to catalyze degradation of RNA. The presented data, which was obtained in the test-tube experiments, adds GCN4 and c-Jun to the pool of proteins with multiple functions (also known as moonlighting proteins). If expressed in vivo, the endoribonuclease activity of these bZIP-containing factors may represent a direct coupling between transcription activation and controlled RNA turnover. As an additional result of this work, the retro-leucine zipper of GCN4 can be added to the list of functional retro-peptides

    Measuring the Evolutionary Rewiring of Biological Networks

    Get PDF
    We have accumulated a large amount of biological network data and expect even more to come. Soon, we anticipate being able to compare many different biological networks as we commonly do for molecular sequences. It has long been believed that many of these networks change, or “rewire”, at different rates. It is therefore important to develop a framework to quantify the differences between networks in a unified fashion. We developed such a formalism based on analogy to simple models of sequence evolution, and used it to conduct a systematic study of network rewiring on all the currently available biological networks. We found that, similar to sequences, biological networks show a decreased rate of change at large time divergences, because of saturation in potential substitutions. However, different types of biological networks consistently rewire at different rates. Using comparative genomics and proteomics data, we found a consistent ordering of the rewiring rates: transcription regulatory, phosphorylation regulatory, genetic interaction, miRNA regulatory, protein interaction, and metabolic pathway network, from fast to slow. This ordering was found in all comparisons we did of matched networks between organisms. To gain further intuition on network rewiring, we compared our observed rewirings with those obtained from simulation. We also investigated how readily our formalism could be mapped to other network contexts; in particular, we showed how it could be applied to analyze changes in a range of “commonplace” networks such as family trees, co-authorships and linux-kernel function dependencies

    Sequence Motifs in MADS Transcription Factors Responsible for Specificity and Diversification of Protein-Protein Interaction

    Get PDF
    Protein sequences encompass tertiary structures and contain information about specific molecular interactions, which in turn determine biological functions of proteins. Knowledge about how protein sequences define interaction specificity is largely missing, in particular for paralogous protein families with high sequence similarity, such as the plant MADS domain transcription factor family. In comparison to the situation in mammalian species, this important family of transcription regulators has expanded enormously in plant species and contains over 100 members in the model plant species Arabidopsis thaliana. Here, we provide insight into the mechanisms that determine protein-protein interaction specificity for the Arabidopsis MADS domain transcription factor family, using an integrated computational and experimental approach. Plant MADS proteins have highly similar amino acid sequences, but their dimerization patterns vary substantially. Our computational analysis uncovered small sequence regions that explain observed differences in dimerization patterns with reasonable accuracy. Furthermore, we show the usefulness of the method for prediction of MADS domain transcription factor interaction networks in other plant species. Introduction of mutations in the predicted interaction motifs demonstrated that single amino acid mutations can have a large effect and lead to loss or gain of specific interactions. In addition, various performed bioinformatics analyses shed light on the way evolution has shaped MADS domain transcription factor interaction specificity. Identified protein-protein interaction motifs appeared to be strongly conserved among orthologs, indicating their evolutionary importance. We also provide evidence that mutations in these motifs can be a source for sub- or neo-functionalization. The analyses presented here take us a step forward in understanding protein-protein interactions and the interplay between protein sequences and network evolution

    Evolution of a New Function by Degenerative Mutation in Cephalochordate Steroid Receptors

    Get PDF
    Gene duplication is the predominant mechanism for the evolution of new genes. Major existing models of this process assume that duplicate genes are redundant; degenerative mutations in one copy can therefore accumulate close to neutrally, usually leading to loss from the genome. When gene products dimerize or interact with other molecules for their functions, however, degenerative mutations in one copy may produce repressor alleles that inhibit the function of the other and are therefore exposed to selection. Here, we describe the evolution of a duplicate repressor by simple degenerative mutations in the steroid hormone receptors (SRs), a biologically crucial vertebrate gene family. We isolated and characterized the SRs of the cephalochordate Branchiostoma floridae, which diverged from other chordates just after duplication of the ancestral SR. The B. floridae genome contains two SRs: BfER, an ortholog of the vertebrate estrogen receptors, and BfSR, an ortholog of the vertebrate receptors for androgens, progestins, and corticosteroids. BfSR is specifically activated by estrogens and recognizes estrogen response elements (EREs) in DNA; BfER does not activate transcription in response to steroid hormones but binds EREs, where it competitively represses BfSR. The two genes are partially coexpressed, particularly in ovary and testis, suggesting an ancient role in germ cell development. These results corroborate previous findings that the ancestral steroid receptor was estrogen-sensitive and indicate that, after duplication, BfSR retained the ancestral function, while BfER evolved the capacity to negatively regulate BfSR. Either of two historical mutations that occurred during BfER evolution is sufficient to generate a competitive repressor. Our findings suggest that after duplication of genes whose functions depend on specific molecular interactions, high-probability degenerative mutations can yield novel functions, which are then exposed to positive or negative selection; in either case, the probability of neofunctionalization relative to gene loss is increased compared to existing models

    Environmental sensing and response genes in cnidaria : the chemical defensome in the sea anemone Nematostella vectensis

    Get PDF
    Author Posting. © The Author(s), 2008. This is the author's version of the work. It is posted here by permission of Springer for personal use, not for redistribution. The definitive version was published in Cell Biology and Toxicology 24 (2008): 483-502, doi:10.1007/s10565-008-9107-5.The starlet sea anemone Nematostella vectensis has been recently established as a new model system for the study of the evolution of developmental processes, as cnidaria occupy a key evolutionary position at the base of the bilateria. Cnidaria play important roles in estuarine and reef communities, but are exposed to many environmental stressors. Here I describe the genetic components of a ‘chemical defensome’ in the genome of N. vectensis, and review cnidarian molecular toxicology. Gene families that defend against chemical stressors and the transcription factors that regulate these genes have been termed a ‘chemical defensome,’ and include the cytochromes P450 and other oxidases, various conjugating enyzymes, the ATP-dependent efflux transporters, oxidative detoxification proteins, as well as various transcription factors. These genes account for about 1% (266/27200) of the predicted genes in the sea anemone genome, similar to the proportion observed in tunicates and humans, but lower than that observed in sea urchins. While there are comparable numbers of stress-response genes, the stress sensor genes appear to be reduced in N. vectensis relative to many model protostomes and deuterostomes. Cnidarian toxicology is understudied, especially given the important ecological roles of many cnidarian species. New genomic resources should stimulate the study of chemical stress sensing and response mechanisms in cnidaria, and allow us to further illuminate the evolution of chemical defense gene networks.WHOI Ocean Life Institute and NIH R01-ES01591
    corecore